BIOTECHNOLOGY SB a-S 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number: 
Source: 

Date Processed by STIC: 




THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25 . 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the USPTO website at the following address: 



http://www.uspto.gov/web/offices/pac/checker 



Raw Sequence Listing Error Summary 



ERROR DETECTED 

ATTN: NEW RULES CASES 

1 Wrapped Nucleics 

Wrapped Aminos 



SUGGESTED CORRECTION SERIAL NUMBER: 0?^^ / $ ? 

: PLEASE DISREGARD ENGLISH "ALPHA** HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it Please adjust your right margin to .3; this will 
prevent "wrapping." 



\ *s Mi; 



^Invalid Line Length The rules require that a line not exceed 72 characters in length, this includes white spaces. 



11 



12 



Misaligned Amino 
Numbering 

_Non- ASCII 



_Variable Length 



_Patentln 2.0 
"bug" 



The numbering under each 5 th amino acid is misaligned. Do not use tab codes between numbers; 
use apace characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII text. 

Sequence^) contain n's or Xaa's representing more thah one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused die <220>-<223> section to be missing from amino acid 

sequencers) . Normally, Patentln would* automatically, generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<22 3 > section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

-f^ P ^ T S ^ Ue " CeS Se <l uencc ( s ) missing. If intentional, please insert the following lines for each skipped sequence- 

(OLD RULES) (2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X** is shown) 

(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTIONiSEQ ID NO:X: (insert SEQ'lD NO where "X" is shown) . 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 

8 ^Sfl T S f ?S« CeS Se *^ uence ( s ) mis *ing. If intentional, please insert the following lines for each skipped sequence 

(NEW RULES) <210> sequence id number 

<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 
In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 



_Use of n's or Xaa's 
(NEW RULES) 



10 Invalid <213> 

Response 



_Use of <220> 



_PatentIn 2.0 
"bug- 



Sequence^) missing the <220> "Feature" and associated numeric identifiers and responses 

Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 

"Unknown." Please explain source of genetic material in <220> to <223> section 

(See "Federal Register," 06/0 1/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 Misuse of n n can onI y be used t0 represent a single nucleotide in a nucleic acid sequence. N is not used to represent 

any value not specifically a nucleotide. 

AMC/MH - Biotechnology Systems Branch - OS/2 1/2001 
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OIPE 



RAW SEQUENCE LISTING DATE: 07/26/2001 

PATENT APPLICATION: US/09/903,188 TIME: 14:58:54 



Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\07262001\I903188.raw 



Does Not Comply 



u, am**. Corrected Diskette Needed 

APPLICANT: De Robertis, Edward M. 

Bouwmeester, Tewis 
TITLE OF INVENTION: Endoderm, Cardiac and Neural Inducing 

Factors 
NUMBER OF SEQUENCES : 10 
CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: Majestic, Parsons, Siebert & Hsue 

(B) STREET:' Four Embarcadero Center, Suite 1100 

(C) CITY: San Francisco 

(D) STATE: California 

(E) COUNTRY: U.S.A. 

(F) ZIP: 94111-4106 
COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1 25 
CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US/09/903,188 

(B) FILING DATE: ll-Jul-2001 

(C) CLASSIFICATION: 
PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/020,150 

(B) FILING DATE: 20-JUN-1996 
ATTORNEY/AGENT INFORMATION: 

(A) NAME: Siebert, J. Suzanne 

(B) REGISTRATION NUMBER: 28,758 

(C) REFERENCE/DOCKET NUMBER: 3100.002US1 
TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415/248-5500 

(B) TELEFAX: 415/362-5418 



7 


(i) 


8 




10 


(ii) 


11 




13 


(iii) 


15 


(iv) 


16 




17 




18 




19 




20 




21 




23 


(v) 


24 




25 




26 




27 




29 


(vi) 


30 




-> 31 




32 




34 


(vii) 


35 




36 




38 


(viii) 


39 




40 




41 




43 


(ix) 


44 




45 






ERRORED SEQUENCES 

330 (2) INFORMATION FOR SEQ ID NO: 

332 (i) SEQUENCE CHAR^ERISTJ_ 

333 (A) LENGTH: Gjy^i^ 

334 (B) TYPE: amino acid 

335 (D) TOPOLOGY: linear 
3 37 (ii) MOLECULE TYPE: peptide 

339 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 

341 Met Leu Leu Leu Phe Arg Ala He Pro Met Leu Leu Leu Gly Leu Met 

342 1 5 10 15 

344 Val Leu Gin Thr Asp Cys Glu He Ala Gin Tyr Tyr He Asp Glu Glu 



file://C:\CRF3\Outhold\VsrI903 1 88.htm 
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RAW SEQUENCE LISTING DATE: 07/26/2001 

PATENT APPLICATION: US/09/903 , 188 TIME: 14:58:54 

Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\07262001\I903188.raw 

345 20 25 30 

347 Glu Pro Pro Gly Thr Val He Ala Val Leu Ser Gin His Ser He Phe 

348 35 40 45 

350 Asn Thr Thr Asp He Pro Ala Thr Asn Phe Arg Leu Met Lys Gin Phe 

351 50 55 60 

353 Asn Asn Ser Leu He Gly Val Arg Glu Ser Asp Gly Gin Leu Ser He 

354 65 70 75 80 

356 Met Glu Arg He Asp Arg Glu Gin He Cys Arg Gin Ser Leu His Cys 

357 85 90 95 

359 Asn Leu Ala Leu Asp Val Val Ser Phe Ser Lys Gly His Phe Lys Leu 

360 100 105 HO 

362 Leu Asn Val Lys Val Glu Val Arg Asp He Asn Asp His Ser Pro His 

363 115 120 125 

365 Phe Pro Ser Glu He Met His Val Glu Val Ser Glu Ser Ser Ser Val 

366 130 135 140 

368 Gly Thr Arg He Pro Leu Glu He Ala He Asp Glu Asp Val Gly Ser 

369 145 150 155 160 

371 Asn Ser lie Gin Asn Phe Gin lie Ser Asn Asn Ser His Phe Ser He 

372 165 170 175 

374 Asp Val Leu Thr Arg Ala Asp Gly Val Lys Tyr Ala Asp Leu Val Leu 

375 180 185 190 

377 Met Arg Glu Leu Asp Arg Glu He Gin Pro Thr Tyr He Met Glu Leu 

378 195 200 205 

380 Leu Ala Met Asp Gly Gly Val Pro Ser Leu Ser Gly Thr Ala Val Val 

381 210 215 220 

383 Asn He Arg Val Leu Asp Phe Asn Asp Asn Ser Pro Val Phe Glu Arg 

384 225 230 235 240 

386 Ser Thr lie Ala Val Asp Leu Val Glu Asp Ala Pro Leu Gly Tyr Leu 

387 245 250 255 

389 Leu Leu Glu Leu His Ala Thr Asp Asp Asp Glu Gly Val Asn Gly Glu 
39 ° 260 265 270 

392 He Val Tyr Gly Phe Ser Thr Leu Ala Ser Gin Glu Val Arg Gin Leu 

393 275 280 285 

395 Phe Lys He Asn Ser Arg Thr Gly Ser Val Thr Leu Glu Gly Gin Val 

396 290 295 300 

398 Asp Phe Glu Thr Lys Gin Thr Tyr Glu Phe Glu Val Gin Ala Gin Asp 

399 305 310 315 320 
401 Leu Gly Pro Asn Pro Leu Thr Ala Thr Cys Lys Val Thr Val His He 

A A n ^ r* 



402 325 



330 335 



404 Leu Asp val Asn Asp Asn Thr Pro Ala He Thr He Thr Pro Leu Thr 



350 



405 340 345 

407 Thr Val Asn Ala Gly Val Ala' Tyr He Pro Glu Thr Ala Thr Lys Glu 

408 355 360 365 

410 Asn Phe He Ala Leu He Ser Thr Thr Asp Arg Ala Ser Gly Ser Asn 

411 370 375 380 

413 Gly Gin Val Arg Cys Thr Leu Tyr Gly His Glu His Phe Lys Leu Gin 

414 385 390 395 400 

416 Gin Ala Tyr Glu Asp Ser Tyr Met He Val Thr Thr Ser Thr Leu Asp 

417 405 410 415 
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RAW SEQUENCE LISTING 
PATENT APPLICATION: US/09/903, 188 

Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\07262001\I903188.raw 



DATE: 07/26/2001 
TIME: 14:58:54 



419 
420 
422 
423 
425 
426 
428 
429 
431 
432 
434 
435 
437 
438 
440 
441 
443 
444 
446 
447 
449 
450 
452 
453 
455 
456 
458 
459 
461 
462 
464 
E--> 465 



467 
E--> 468 
470 
E--> 471 
E--> 473 
780 
782 
783 
784 
785 
787 
789 
791 
E--> 792 
794 
795 
797 



Arg Glu Asn He Ala Ala Tyr Ser Leu Thr Val Val Ala Glu Asp Leu 

420 425 430 

Gly Phe Pro Ser Leu Lys Thr Lys Lys Tyr Tyr Thr Val Lys Val Ser 

435 440 445 

Asp Glu Asn Asp Asn Ala Pro Val Phe Ser Lys Pro Gin Tyr Glu Ala 

450 455 460 

Ser He Leu Glu Asn Asn Ala Pro Gly Ser Tyr He Thr Thr Val He 
465 470 475 480 

Ala Arg Asp Ser Asp Ser Asp Gin Asn Gly Lys Val Asn Tyr Arg Leu 

485 490 495 

Val Asp Ala Lys Val Met Gly Gin Ser Leu Thr Thr Phe Val Ser Leu 

500 505 510 

Asp Ala Asp Ser Gly Val Leu Arg Ala Val Arg Ser Leu Asp Tyr Glu 

515 520 525 

Lys Leu Lys Gin Leu Asp Phe Glu He Glu Ala Ala Asp Asn Gly He 

530 535 540 

Pro Gin Leu Ser Thr Arg Val Gin Leu Asn Leu Arg He Val Asp Gin 
545 550 555 560 

Asn Asp Asn Cys Pro Val He Thr Asn Pro Leu Leu Asn Asn Gly Ser 

565 570 575 

Gly Glu Val Leu Leu Pro He Ser Ala Pro Gin Asn Tyr Leu Val Phe 

580 585 590 

Gin Leu Lys Ala Glu Asp Ser Asp Glu Gly His Asn Ser Gin Leu Phe 

595 600 6 05 

Tyr Thr He Leu Arg Asp Pro Ser Arg Leu Phe Ala He Asn Lys Glu 

610 615 620 

Ser Gly Glu Val Phe Leu Lys Lys Gin Leu Asn Ser Asp His Ser Glu 
625 630 635 640 

Asp Leu Ser He Val Val Ala Val Tyr Asp Leu Gly Arg Pro Ser Leu 
645 650 655 ' " 

Val Ala Asn Arg Met His Ala Glu Tyr Glu Arg Asp Leu Val Asn Arg 

Ser Ala Thr Leu Ser Pro Gin Arg Ser Ser Ser Arg Tyr Gin Glu Phe 

Asn Tyr Ser Pro Gin He Ser Arg Gin Leu His pro Ser Glu He Ala 

T hr Thr Phe n^C 

(2) INFORMATION FOR SEQ ID NO : 9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9: 
Met Val Cys Gly Ser Pro Gly Gly Met Leu Leu Leu Arg Ala Gly Leu 
1 5 & P 15 J 

Leu Ala Leu Ala Ala Leu Cys Leu Leu Arg Val Pro Gly Ala Arg Ala 

20 25 30 

Ala Ala Cys Glu Pro Val Arg He Pro Leu Cys Lys Ser Leu Pro Trp 
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SAW SEQUENCE LISTING DATE : 07/26/2001 

PATENT APPLICATION: US/09/903,188 TIME: 14:58:54 

Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\07262001\I903188.raw 

798 35 40 45 

800 Asn Met Thr Lys Met Pro Asn His Leu His His Ser Thr Gin Ala Asn 

801 50 55 60 

803 Ala He Leu Ala He Glu Gin Phe Glu Gly Leu Leu Gly Thr His Cys 



804 65 70 
806 



75 80 



Ser Pro Asp Leu Leu Phe Phe Leu Cys Ala Met Tyr Ala Pro He Cys 



807 85 



90 95 



809 Thr He Asp Phe Gin His Glu Pro He Lys Pro Cys Lys Ser Val Cys 



110 



810 100 105 

812 Glu Arg Ala Arg Gin Gly Cys Glu Pro He Leu He Lys Tyr Arg His 

813 H5 120 



125 

815 Ser Trp Pro Glu Asn Leu Ala Cys Glu Glu Leu Pro Val Tyr Asp Arg 

816 130 135 140 

818 Gly Val Cys He Ser Pro Glu Ala He Val Thr Ala Asp Gly Ala Asp 

819 145 150 155 160 

821 Phe Pro Met Asp Ser Ser Asn Gly Asn Cys Arg Gly Ala Ser Ser Glu 

822 165 170 175 

824 Arg Cys Lys Cys Lys Pro He Arg Ala Thr Gin Lys Thr Tyr Phe Arq 

825 180 185 190 

827 Asn Asn Tyr Asn Tyr Val He Arg Ala Lys Val Lys Glu He Lys Thr 

828 195 200 205 

830 Lys Cys His Asp Val Thr Ala Val Val Glu Val Lys Glu He Leu Lys 

831 210 215 220 

833 Ser Ser Leu Val Asn He Pro Arg Asp Thr Val Asn Leu Tyr Thr Ser 

834 225 230 235 240 

836 Ser Gly Cys Leu Cys Pro Pro Leu Asn Val Asn Glu Glu Tyr He He 

837 245 250 255 

839 Met Gly Tyr Glu Asp Glu Glu Arg Ser Arg Leu Leu Leu Val Glu Gly 

840 2 60 265 270 

842 Ser He Ala Glu Lys Trp Lys Asp Arg Leu Gly Lys Lys Val Lys Arg 

843 275 280 285 

845 Trp Asp Met Lys Leu Arg His Leu Gly Leu Ser Lys Ser Asp Ser Ser 

846 290 295 300 

848 Asn Ser Asp Ser Thr Gin Ser Gin Lys Ser Gly Arg Asn Ser Asn Pro 

849 305 310 315 320 

851 Arg Gin Ala Arg Asn 

852 325 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/90 3 , 188 



DATE: 07/26/2001 
TIME: 14:58:55 



Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\07262001\I903188.raw 



L:30 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER : ] 
L:31 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 
L:465 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:5 
M:332 Repeated in SeqNo-5 

L:473 M:203 E: No. of Seq. differs, LENGTH: Input: 979 Found: 707 SEQ:5 
L:792 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 9 



file://C:\CRF3\Outhold\VsrI903 1 88.htm 



7/26/01 



